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Abstract 

Massive multiplayer online role-playing games (MMORPGs) are very popular in past few years. The profit of an 
MMORPG company is proportional to how many users registered, and the instant number of online avatars is a key 
factor to assess how popular an MMORPG is. We use the on-off'-line logs on an MMORPG server to reconstruct the 
instant number of online avatars per second and investigate its statistical properties. We find that the online avatar 
number exhibits one-day periodic behavior and clear intraday pattern, the fluctuation distribution of the online avatar 
numbers has a leptokurtic non-Gaussian shape with power-law tails, and the increments of online avatar numbers 
after removing the intraday pattern are uncorrelated and the associated absolute values have long-term correlation. In 
addition, both time series exhibit multifractal nature. 
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1. Introduction 

A massive multiplayer online role-playing game (MMORPG) is a genre of online role-playing games (ORPGs) in 
which a large number of players interact with one another within a virtual world. The term MMORPG was coined by 
Richard Garriott, who created Ultima Online. In mainland China, Shanda Interactive Entertainment Ltd is the leader 
of the MMORPG industry, which is based in Shanghai. Shanda runs dozens of online games and has most registered 
players. 

An MMORPG forms an online virtual world, where people can work and interact with one another in a somewhat 
realistic manner Therefore, virtual worlds have great potential for research in the social, behavioral, and economic 
sciences jltl. For instance, we can design a kind of virus in a virtual world and let it spread to investigate its epi- 
demics, we can design some economic games in a virtual world to study the formation of human cooperation (indeed, 
numerical experiments have been done and we can record the economic behaviors of avatars to understand the 
evolution of wealth distribution. A pioneering work was done by Edward Castronova, who traveled in a virtual world 
called "Norrath" and performed preliminary analysis of its economy yn. Recently, there are also efforts in the field 
of computational social sciences from a complex network perspectiveu3> El S 1^ IM 1^- In addition to its scientific 
potentials, virtual worlds could act as nice places for real social activities, such as marketing lEElIil, and provide 



opportunities for players to make real money II13I1 . 
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In this work, we investigate the behavior of instant online avatar numbers in a server of a very popular MMORPG. 
The number of instant online avatars is of crucial importance for scientific and commercial purposes. The paper is 
organized as follows. In Section |2l we describe the data and the procedure to construct the time series of instant 
online avatar numbers. Section [3] studies the seasonal patterns of the time series. The probability distribution of 
the fluctuations of online avatar numbers is researched in Section H) and the temporal correlations and multifractal 
properties are analyzed in Section|5] Section|6]summarizes the main findings of this paper. 



2. Data preprocessing 

The MMORPG game investigated is called "Legend of Mir", which is copyrighted and run by Shanda Interactive 
Entertainment Ltd in Shanghai. This online game was very popular in China several years ago. 

An avatar is activated when a player logs on an MMORPG server When he quits the game, the server records an 
entry including the time moments of his log-on and log-off, accurate to one second. An on-off-line log is saved at the 
end of each day. This allows us to reconstruct the number of online avatars n(t) at each second t. The data we analyzed 
are from 1 September 2007 to 31 October 2007. The variable is divided by its mean in the presentation of this paper, 
which does not change the results. A segment of online avatar numbers is plotted in Fig.[T](a). It is not surprising that 
there is a daily periodic pattern in the evolution of online avatar number n(t). The number n(t) has an intraday low at 
around 7:30 and then increases in the following daytime, with a plateau from 12:00 to 18:00. At around 20:00, n{t) 
reaches its maximum value. This pattern repeats every day. 
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Figure 1: (color online) The online avatar numbers «(f) for (a) a continuous period of five days and (b) five separate days. 



Fig. [lib) illustrates the plots of online avatar numbers on five days including 10 September 2007, 25 September 
2007, 5 October 2007, 25 October 2007, and 31 October 2007. One finds that the curves almost share the same shape 
except for 25 September 2007 and 31 October 2007. For 25 September 2007, the online avatar number fell rapidly 
and remained zero between 00:30 and 01:30 in the morning. This phenomenon is observed for several days in the 
sample, which is due to the fact that the server was scheduled for maintaining or game version updating after midnight. 
However, Fig. [T] suggests that the maintaining time is better to be around 7:00 (say, from 6:30 to 7:30) in the morning 
in order to impact less players. For 31 October 2007, there was a sharp decease of the online avatar number at the 
end of the day. This reflects a finite-size effect or a boundary effect. Note that our data set is truncated at 23:59:59 
on 31 October 2007 and the on-off-line logs are recorded based on the log-off time. This means that the logs of 31 
October 2007 exclude the situation that the avatar went online before 23:59:59 and offline after 23:59:59. Therefore, 
the online avatar numbers in this last day are excluded from our analysis, and the resultant data set has 60 days. 
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3. Seasonal patterns 



3.1. Spectral analysis 

Because of the circadian and weekly cycles of human activity, evident periodicity is observed in the time series 
of online avatar numbers. A spectral analysis is adopted to quantify the periodic behavior. Fig. |2a) illustrates the 
power spectrum of online avatar number series. The units of the frequency / is 1/day. The highest peak lies at 
/ - 0.0167, which captures nothing but the weak global trend of the time series 1 14 1. The second highest peak locates 
at fi = 1.0167, which is statistically significant with a p-value of 3.41 x lO"'*^ lIlSll . It implies that the periodicity is 
about one day, as suggested by Fig. [Ha). We also see harmonic peaks around f - 2,3, which further confirms that 
the observed one-day periodicity is not artificial. 
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Figure 2: Spectral analysis of online avatar numbers by means of fast Fourier transformation: (a) Power spectrum; (b) Fundamental frequency. 



In order to determine the fundamental frequency /o using harmonic peaks, we can regress the following equation 
between the harmonic frequencies fi and the corresponding orders of the associated peaks, lfl6i[l7 , 181. 



fi^a + ix fo. (1) 

We choose the peaks that are significant higher than its neighboring spectral powers in Fig. I2a), and five peaks at 
/i = 1.0167, /2 = 2.0167, /3 = 3.0167, /4 = 4.0167, and fi = 7.0167 are determined. The plot of / against i shown 
in Fig.|2b) exhibits nice linearity. A linear least-squares regression gives a = 0.0167 and /o = 1.000. The F-test finds 
the relation ([TJ significant with a p-value of zero. The t-test shows that the coefficient a is significantly different from 
zero and the hypothesis /o = 1 can not be rejected at the significance level 0.0001 . This fundamental frequency fo - I 
corresponds to an exact one-day periodicity, which is the base for the search of a possible intraday pattern. 



3.2. Intraday pattern and weekly pattern 

In order to investigate the seasonal patterns in the time series, we define the average online avatar number, calcu- 
lated as follows, 

jeD 

where D stands for the set of #!D days under consideration and nj{t) represents the online avatar numbers on j-th day 
of D. For instance, if we need to estimate the intraday average online avatar numbers for all the holidays, D is the set 
containing all the holidays (Saturday, Sunday, and public holidays) in the two corresponding months. Note that the 
periods that have vanishing online avatar numbers are excluded from the averaging procedure. 

As a first step, we partition all 60 days into two groups, one containing all working days and the other including all 
weekends and public holidays. The intraday patterns of these two groups of days are shown in Fig.|3la). The average 
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online number grows gradually after 7:30 in the morning. During the time period between 12:00 and 18:00, the online 
number is almost stationary. After 18:00, the online number increases again before around 20:00. Then, the number 
drops till 7:30 in the next day. The trend of online avatar number is consistent with the circadian rhythm of human 
activities. In addition, the average online avatar number for holidays is larger than that for working days, which 
is trivial. For working days, we also calculate the intraday patterns for Monday, Tuesday, Wednesday, Thursdays, 
and Friday, which are presented in Fig. [3lb). Roughly speaking, the five curves almost overlap and no remarkable 
difference is revealed among these days. However, a careful scrutiny unveils that the average number curve for Friday 
keeps above other days from noon to midnight. The pattern in Friday evening is explained by the fact that Friday is 
followed by Saturday and most of the players are free on Saturday, while that in the Friday afternoon is explained 
by the fact that most college students do not have courses and many official institutions have much less work to do, 
for instance, only a small part of the officials might have meetings. This Friday afternoon pattern is expected to be 
idiosyncratic for MMORPGs played mainly by Chinese. 
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Figure 3: Seasonal patterns of online avatar numbers: (a) Intraday pattern for working days, and holidays (including Saturday, Sunday, and public 
holidays); (b) Intraday pattern for Monday, Tuesday, Wednesday, Thursday, and Friday (consider only the sample of working day); (c) Dependence 
of rid as a function of /; (d) Weekly pattern. 



In order to further distinguish the differences among the intraday patterns of Monday, Tuesday, Wednesday, Thurs- 
day, and Friday, we define that 

ridit) = («,(0) - <«,.(0>, (3) 

where i - 1 , 2, 3, 4, 5 stand for the five kinds of days (Monday, Tuesday, Wednesday, Thursday, and Friday), {yii{t)) is 
the intraday pattern of each kind of weekdays, and (n^t it)} represents the intraday pattern of working days shown in 
[3ta). Fig.[3tc) depicts the dependence of as a function of t. One can observe that the online numbers of Mondays 
(Fridays) are much larger than those of the four other days before (after) 12:00. This suggests a weak weekly pattern, 
which is illustrated in Fig.[3td). However, this weekly pattern is very weak and considering only the intraday pattern 
is sufficient for most quantitative analyses. 
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4. Probability distributions of A« 



We now study the probability distribution of the fluctuations of online avatar numbers An, which is defined as the 
diff'erence of online avatar numbers n{f) in two successive seconds. 



A«(f) = «(r) - n(t - 1). 



(4) 



Fig.HJa) illustrates the empirical density function of An in log-linear scales. It is seen by eyeballing that the distribu- 
tion has a leptokurtic fat-tailed non-Gaussian shape. The non-Gaussian feature of the fluctuation distribution can be 
characterized by the QQ-plot shown in Fig.SJb). To investigate the tails, we show in Fig.Hfc) the survival function of 
|A«|, An > 0, and An < 0. The method proposed by Clauset, Shalizi and Newman lfl9ll has been employed to confirm 
that the distributions have power-law tails. 



when X ^ x^in. We find that j - 3.24 and x„ 
and jCmin = 4 for x = An < 0. 



C{x) ~ x-^ 
: 4 for X = |An|, y 



3.38 and x„ 



(5) 

6 for X = An > 0, and y = 2.78 
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Figure 4: (a) Probability density of the fluctuations of online avatar numbers An. (b) QQ plot of An. (c) Survival distribution of An. The curves of 
Aji > and An < have been translated vertically by a factor of 0.1 and 0.001 in turn for better visibility. 



5. Long memory and multifractality 

In this section, we study the memory effect in the online avatar number time series n(f). As shown in Section [3l 
there is an evident intraday pattern in the time series. This periodic pattern is removed before correlation analysis for 
each day j, which results in a new time series; 

n,(t)^nj{t)/{n}, ; = 1, 2, ■ ■ ■ , 60. (6) 

The fluctuation of ririt) is defined as follows 

An,(f) = n,(f) - n,(f - 1). (7) 

We stress that, Anr(f) is the right quantity to check the memory effect of n(f), but investigating its distribution is of no 
interest and seems meaningless. 



5.1. Long memory 

The detrended fluctuation analysis (DFA) is utilized, which has the ability to extract long-range power-law cor- 
relation in non-stationary time series |23, 21]. For a given intertrade duration series {Anr(t)\t - 1, 2, ■ ■ ■ , T}, we can 
define the cumulative summation series ^(f) as follows. 
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(8) 



t'=i 



Note that the mean of the A«,.(f) series is not removed before the cumulative summation, which will be removed in 
the detrending step. The series y(t) is covered by A^, disjoint boxes with the same size s. When the whole series y{t) 
cannot be completely covered by Ns boxes, we can utilize 2Ns boxes to cover the series starting from both ends of 
the time series. In each box, a cubic polynomial trend function g of the sub-series is determined. The local detrended 
fluctuation function fi^is) in the A:-th box is defined as the r.m.s. of the fitting residuals: 



1 



ks 



f=(A--l)s+l 

The overall detrended fluctuation is estimated as follows 



Yj bit)-g{t)f 



(9) 



(10) 



As the box size s varies in the range of [100, T/4], one can determine the power-law relationship between the overall 
fluctuation function F2is) and the box size s, which reads. 



F2(s) ~ i^. 



(11) 



where H signifies the Hurst index, which is related to the power spectrum exponent 77 by 77 = 2H - 1 11221 12311 and to 
the autocorrelation exponent 7 by y = 2 - 2H 11241 12111 . 

Fig.[5]illustrates the dependence of the overall fluctuation function F2is) of An,.(f) with respect to the box size s 
in double logarithmic coordinates. There is a nice power-law relation spanning more than two orders of magnitude. 
A linear least-squares regression of In F2{s) against In s gives the estimate of the Hurst index H - 0.481 +0.005. This 
value is very close to H - 0.5 and we argue that there is no temporal correlation in the time series of An, (f). We also 
show the DFA results for |A«,(f)| in Fig.|5] Again, we see a nice power law and the exponent is // = 0.868 + 0.012. 
In other words, there is evident long memory in the absolute fluctuations |A«,(f)| of online avatar numbers. This 
observation is reminiscent of the behavior of stock returns ll25ll . 
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Figure 5: (color online) Detrended fluctuation analysis of A«,-(?) and |A;i, (01- The solid lines are the best power-law fits to data in the corresponding 
scaling ranges. The hurst indexes are H = 0.481 ± 0.005 for An,, and H = 0.868 ± 0.012 for |A;i,-|. 
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5.2. Multifractal nature 

The DFA method can be extended to detect multifractal nature, known as the multifractal detrended fluctuation 
analysis (MF-DFA) ll26ll . The overall detrended fluctuation in Eq. (fTOl i is generalized to the following form 

{ \ \ 

where q can take any real number except ^ = 0. When ^ = 0, we have 

Fo(5) = exp|^gln[/,W]|. (13) 

By varying the value of s in the range from imin - 100 to imax - T/4, one can expect the detrended fluctuation 
function Fq{s) scales with the size s: 

F,{s) ~ (14) 

where h{q) is the generalized Hurst index. Note that when q -2, h(2) is nothing but the Hurst index H. We focus on 
q e [-4, 6] to obtain reasonable statistics in the estimation of Fg{s). 

The overall detrended fluctuations Fq{s) are plotted as a function of s in log-log scales in Fig.|6]for An^ and lAn^l 
and different orders q. The power-law relation ( fT4l i is verified for all the curves, with the scaling ranges wider than 
two orders of magnitude. An anomalous feature is observed in Fig. |6ja) for An,, that the F-2{s) curve is flatter than 
the Fq{s) curve. It means that h{-2) < li{Q), which is not common even for nonconservative quantities in multifractal 
analysis. This anomaly is not observed in Fig.|6jb) for |A«,-| 




Figure 6: (Color online) Dependence of the overall fluctuation functions F^(s) with respect to the box size i for q = -2, 0, 2 and 4 in log-log 
coordinates for (a) An, and (b) |Aji, |. 



The scaling exponent function T(q), which is used to reveal the multifractality in the standard multifractal formal- 
ism based on partition function, can be obtained numerically as follows: 

T(q)^qh(q)-Df, (15) 

where D j is the fractal dimension of the geometric support of the multifractal measure (in the current case we have 
Df =1). The local singularity exponent a and its spectrum f{a) are related to T(q) through the Legendre transforma- 



tion \m 



a - dT(q)/dq 

(16) 

fia) ^qa- T{q) 
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Since the size of each time series is finite, the estimate of Fg{s) will fluctuate remarkably for large values of |^|, 
especially for large s. 

Fig.|7ta) illustrates the generalized Hurst indexes h(q) as a function of q for An, and |A«rl- We find that the h(q) 
function for | A«, | decreases monotonically, while that for Aiir increases in the left part and decreases in the right part as 
indicated by Fig.|6ta). Similar phenomenon is rare with only a few examples 12811 . Fig.|3b) shows the corresponding 
scaling exponent functions T{q). The nonlinearity in T{q) is a hallmark for the presence of multifractality in the time 
series. Fig. |7lc) presents the two multifractal singularity spectra f{a) for the two time series. Again, the spectrum 
f{a) for Aiir exhibits abnormal behavior 




31 



(b) 






-B-An 




r 




-^lA/i 1 




r 



2 




Figure 7: (Color online) Multifractal analysis of An, (CI) and |A;i, | (». Shown are the generalized Hurst indexes h(q) (a), the mass exponents T(q) 
(b), and the multifractal spectra f(a) (c). 



6. Conclusion 

In summary, we have investigated the statistical properties of the time series of instant number of online avatars 
in a massive multiplayer online role-playing game. Spectral analysis shows that the online avatar number exhibits 
one-day periodic behavior and clear intraday pattern. On the contrary, our analysis suggests that the maintaining of 
server should be scheduled before 7:30 a.m. rather than in the wee hours. We also found that the fluctuations of the 
online avatar numbers do not follow a Gaussian distribution. Instead, the distribution is leptokurtic and fat-tailed. 
A maximum likelihood method based on the Kolmogorov-Smirnov statistic shows that the distribution has power- 
law tails. We also employed the (multifractal) detrended fluctuation analysis to investigate the memory effect of the 
increments of online avatar numbers after removing the intraday pattern and the associated absolute values. We found 
that the increments do not possess temporal correlation while the absolute increments are long-term correlated. In 
addition, both time series exhibit multifractal nature. 
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